Rank in Wordlist | Frequency | Word |
---|---|---|
5204 | 10 | 2,5 |
6171 | 8 | 1,5 |
8740 | 5 | 0,5 |
10403 | 4 | 3,6 |
12770 | 3 | 1,000 |
12771 | 3 | 1,3 |
12829 | 3 | 2,6 |
12849 | 3 | 3,3 |
12850 | 3 | 3,5 |
16805 | 2 | 0,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
27501 | 1 | 2T-C(1600 |
27647 | 1 | 35(37 |
27710 | 1 | 3T-C(1800 |
27715 | 1 | 4(empat |
27881 | 1 | 5(Kepemimpinan |
28114 | 1 | 6–7(8 |
28124 | 1 | 7-6(3 |
28239 | 1 | 7–6(4 |
28445 | 1 | A321-200(EI-LVA |
28825 | 1 | Aftermath(1966 |
Rank in Wordlist | Frequency | Word |
---|---|---|
27128 | 1 | 2)Permanen |
27223 | 1 | 2/3)e |
27514 | 1 | 3)Dari |
27716 | 1 | 4)Melalui |
27882 | 1 | 5)Askogonium |
30596 | 1 | Bass),Zens |
31796 | 1 | CH(OH)COOH,EDTAH |
33015 | 1 | D.Roger).Setelah |
36371 | 1 | Guitar),Hamam |
36372 | 1 | Guitar),Liphen |
Rank in Wordlist | Frequency | Word |
---|---|---|
26330 | 1 | 10%—30 |
27227 | 1 | 20%-30 |
27574 | 1 | 30%- |
27575 | 1 | 30%—100 |
28218 | 1 | 78%-81 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4942 | 11 | R&B |
14256 | 3 | P&G |
21203 | 2 | S&M |
21775 | 2 | T&T |
28426 | 1 | A&M |
28427 | 1 | A&R |
28566 | 1 | AT&T |
36455 | 1 | H&M |
40812 | 1 | Lilo & Stitch |
41174 | 1 | M&C |
Rank in Wordlist | Frequency | Word |
---|---|---|
22110 | 2 | US$10 |
28550 | 1 | AS$0,25 |
28551 | 1 | AS$100 |
28552 | 1 | AS$120 |
28553 | 1 | AS$2 |
28554 | 1 | AS$2.000 |
28555 | 1 | AS$50 |
28556 | 1 | AS$55 |
36495 | 1 | HK$13,33 |
43199 | 1 | NT$40 |
Rank in Wordlist | Frequency | Word |
---|---|---|
26133 | 1 | 00"-118°48 |
26134 | 1 | 00"-8°30 |
27055 | 1 | 1971:"The |
54069 | 1 | bebas"-nya |
57897 | 1 | ibu"nya |
58366 | 1 | julukan"Le |
60084 | 1 | makna:"Istana |
60593 | 1 | mempromosikan,"katanya |
61390 | 1 | mereka,"I |
64315 | 1 | rumah"(pemanas |
Rank in Wordlist | Frequency | Word |
---|---|---|
14225 | 3 | O'Brien |
18885 | 2 | Hapoel Be'er Sheva |
19253 | 2 | John's College |
20409 | 2 | O'Higgins |
21243 | 2 | Sa'ad |
31314 | 1 | Borgofranco d'Ivrea |
32423 | 1 | Christ's College |
32726 | 1 | Conan O'Brien |
32838 | 1 | Cosimo de' Medici |
35069 | 1 | Fiddlin' John Carson |
Rank in Wordlist | Frequency | Word |
---|---|---|
17523 | 2 | B-B+B-B |
18664 | 2 | GMT+14 |
32932 | 1 | Ctrl+KC |
51365 | 1 | U+02DA |
51458 | 1 | UTC+6 |
51459 | 1 | UTC+8 |
51974 | 1 | Visual C++ |
Rank in Wordlist | Frequency | Word |
---|---|---|
34770 | 1 | F*ck |
47342 | 1 | SM*SH |
47343 | 1 | SM*Sh |
Rank in Wordlist | Frequency | Word |
---|---|---|
5405 | 10 | dan/atau |
7626 | 6 | 2002/2003 |
8556 | 6 | pemain/pasangan |
9894 | 5 | km/jam |
10077 | 5 | m³/detik |
12833 | 3 | 2009/2010 |
13481 | 3 | GNU/Linux |
13558 | 3 | HIV/AIDS |
14227 | 3 | OS/2 |
15683 | 3 | jiwa/km² |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots